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Abstract We start with Fodor's critique of cognitive science in [8]: he 
argues that much mental activity cannot be handled by the current 
methods of cognitive science because it is nonmonotonic and, there- 
fore, is global in nature, is not context-free, and is thus not capable 
of being formalised by a Turing-like mental architecture. We look at 
the use of non-monotonic logic in the Artificial Intelligence community, 
particularly with the discussion of the so-called "frame problem". The 
mainstream approach to the frame problem is, we argue, probably sus- 
ceptible to Fodor's critique: however, there is an alternative approach, 
due to McCain and Turner, which is, when suitably reformulated, not 
susceptible. In the course of our argument, we give a proof theory for 
the McCain- Turner system, and show that it satisfies cut elimination. 
We have two substantive conclusions; firstly, that Fodor's argument de- 
pends on assumptions about logical form which not all non-monotonic 
theories satisfy; and, secondly, that metatheory plays an important role 
in the context of evolutionary accounts of rationality. 
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Introduction 



Fodor has argued [8] that mental processes fall into two classes: those which 
are modular (that is, effectively encapsulated from other mental processes), 
and those which are not. The modular ones take place by means of local, 
syntactic operations on mental representations [8, pp. 18f.]: if they function 
properly, they will be insensitive to what may or may not happen outside their 
module. In fact, more than this is true: such processes are context invariant [8, 
pp. 34f.]. All they care about is the syntax of the mental representations which 
they are working on. In particular, they are monotonic: the validity of the 
inferences they perform is unaffected by adding extra mental representations 
to their context. 

The non-modular ones, by contrast, are not monotonic. Fodor's examples 
of such non-modular processes are, firstly, abduction, and, secondly, the belief 
change apparatus associated with abductive inference (including judgements 
as to the centrality of particular beliefs in one's belief system). These pro- 
cesses are, plausibly, non-monotonic and context-sensitive [8, pp. 33ff.]: and 
thus, Fodor goes on to argue, they cannot be described as operations on the 
(context-invariant) syntax of mental representations. 

Now one could quibble about all this in various ways: one could worry 
about whether Fodor had distinguished sufficiently well between abstract and 
concrete; syntax [22], one could wonder whether he had taken sufficiently into 
account the possibility of context dependence which was regular enough not 
to have the deleterious consequences that he wishes to avoid [30] - but, for 
all that, his argument is cogent and worrying. 

Rather than raising these sort of objections to Fodor's argument, it may be 
more illuminating to look at how the artificial intelligence community deals 
with this sort of problem. We will look at what is described in the AI com- 
munity as the "frame problem" : although this problem, which is mainly con- 
cerned with reasoning about change in the world, is somewhat different from 
the belief change problems that Fodor discusses, the two areas arc systemat- 
ically related [3], so that discussions of one should, mutatis mutandis, apply 
to the other. 
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Our results arc interesting. We investigate two approaches to the frame 
problem: one of them, the more mainstream one, is based on the idea of min- 
imising change, and probably is susceptible to critique in Fodor's manner. 
However, the other one, due to McCain and Turner, is probably not thus sus- 
ceptible: it shows illuminatingly how reasoning can be nonmonotonic without 
being catastrophically global. 

In arguing for this position, we have work, both technical and philosophical, 
to do. The McCain- Turner system is usually presented model-theoretically: 
we develop a proof theory for it, in order to show directly what reasoning in 
that system is like. In particular, we prove a cut elimination result, which can 
be regarded as showing how the system can support nonmonotonic inference 
and still be tractable. Proofs of these results are quite technical, and have 
been modularised into the appendix of this paper. 

We also have philosophical work to do. McCain and Turner present their 
system as a system of what they term "causal reasoning": I argue that the 
system can, much more plausibly, be presented as a logic of explanation. This, 
of course, brings it much closer to Fodor's concern with large-scale cognitive 
architecture, in which explanation ought to play a central role. So, in order 
to make this claim more plausible, I describe an interesting logic of argument 
due to Parsons and Jennings: this logic turns out to be merely a notational 
variant of a fragment of the McCain- Turner system. 

1 The Logical Background 

1.1 Basics Regardless of the details of the common approaches to reason- 
ing about action, they all have several things in common. They all assume 
that one should start with a logical description of actions and their effects: 
they all (or mostly all) assume that the logical apparatus should deliver a 
description of the sequence of states of affairs resulting from a given sequence 
of actions. We will, for the sake of simplicity, assume that we have linear 
discrete time, indexed by the natural numbers: propositions will have an ar- 
gument place for the time, into which terms standing for temporal instants 
can be substituted. Devotees of more sophisticated approaches - for exam- 
ple, of Reiter's elegant situation calculus formulation [27] - are welcome to 
translate my oversimplified notation into their variant. 

1.2 The Frame Problem A fundamental difficulty behind using logic to rea- 
son about action in this way is what is known as the frame problem. The 
problem is that, given the state of the world before an action, there are many, 
merely "logically possible", states of the world after it; but most of these 
states are physically unrealistic. This remains so even if we give what seem 
to be perfectly adequate logical formalisations of actions. 

Here is a famous example of what is at issue. 

Example 1.1 (The Yale Shooting Problem) Suppose that we describe a state 
of the world at a particular time t by propositions alive and loaded. The world (a 
particularly simple one) contains a victim, a gun (and, presumably, a shooter, 
but he or she can remain unformalised) . aiivet will say that some victim is still 
alive at t, whereas loadedt will say that a gun is loaded at that time. Consider 
actions shoot and wait I wait has no effect, whereas shoot, if the gun is loaded. 
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kills the victim. So wc can specify the shoot action as follows (for the sake of 
simplicity, we assume that every action takes one unit of time) : 

shoot^ A loaded^ K ~ialive^_|_i (1) 

Suppose now we have the following scenario: the gun is loaded at time 0, then 
there is a wait action, and the gun is then shot. So the following sequence of 
truth values is compatible with our specification: 



t 





1 


2 




wait 


shoot 




alive 


T 


T 




loaded 


T 


T 





(2) 



Unfortunately, so is this: 
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wait 


shoot 




alive 
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loaded 
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_L 


_L 



(3) 



However one might wish to diagnose the situation here, it is clear that some- 
thing has gone wrong: logic, supplemented by unproblematic-seeming axioma- 
tisations of the effects of actions, generates more solutions to these problems 
than it should. We need extra constraints, and there have been at least 
two attempts to say what sort of constraints we should have: a mainstream 
approach, based on the idea of minimisation, and an alternative approach, 
rather less clearly formulated, but technically more viable and more interest- 
ing. As I shall argue, the alternative approach is, in fact, based on the idea 
of explanation. 

1.3 Minimisation-Based Approaches We first outline, and criticise, the 
minimisation-based approaches: this will not only show some of the issues 
at stake, but it will also give an example of an approach which is, arguably, 
susceptible to Fodor's critique. 

The basic idea behind these approaches is simple: that we should pick 
those histories which minimise the amount of change. One can, presum- 
ably, motivate this criterion as follows: the ax;iomatisations of actions give 
the changes which must take place if the actions are to be performed, and 
one wants exactly thc;sc! changes and no others. Hence one should have a 
minimal amount of change. This idea is very close to Lewis' treatment of 
counterfactuals [12; 14; 13], and thus has a respectable philosophical pedi- 
gree. It can, furthermore, be supported by a mathematical argument: a wide 
class of non-monotonic logics can be specified by giving a preference relation 
on models of some monotonic theory, and then selecting only those models 
which are minimal according to the preference relation [4]. So, this approach 
seems to have the advantage of a strong philosophical motivation, together 
with mathematical generality. 

However, problems arise when one tries to make this approach more spe- 
cific. Example 1.1 shows the first of these problems: there are two histories 
in that example, and the wrong one has one change of truth value, while 
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the correct one has two changes. Even if we minimise by looking at contain- 
ment relations between sets of changes, rather than naively counting them, 
the problem persists. So minimisation often gives incorrect results. One can, 
perhaps, fix it, but many of the fixes (for example, preferring later changes to 
earlier ones) seem to be merely ad hoc. 

The second problem is this: what does one mean by "amount of change" ? 
This is, admittedly, a vague description, but it is not clear how to make it 
any less vague. The usual approach is this: one fixes a language, and then, 
for each history, one fixes the set of truth- value changes (that is, the set 
of ordered pairs consisting of a time and a primitive of the language which 
changes truth- value at that time). Containment between these sets gives 
a preference relation between histories: those histories are to be preferred 
which have minimal sets of truth-value changes. One can give this procedure 
a much more sophisticated and formal definition: it is generally known as 
circumscription [16], and is widely used in the AI community. 

The problem with this approach is that it is strongly language-dependent: 
it is possible to have equivalent languages, with different primitives, which 
give different preference relations according to this definition. One can, by 
a suitable choice of primitives for the language, make the preference relation 
come out any way that one wants [35]. In more technical language, the en- 
tailment relations which circumscription yields are not preserved by uniform 
substitution. Failure of imiform substitution is (pace Makinson [18]) a cause 
of concern, for two reasons. Practically, it makes the use of these theories 
rather problematic: there may well be some magical set of primitives which 
make the predictions come out right, but one is given no guidance as to what 
it might be. And, theoretically, this remains a worry: closure under uniform 
substitution is motivated by a concern that mathematics should be more than 
mere syntax, that is, that the entities which mathematics and logic describe 
should be independent of our choice of the language - the "coordinate frame" , 
as it were - which we use to describe them. 

Now Lewis works with possible worlds, which, for him, are first class indi- 
viduals, and his closeness relation between possible worlds - which corresponds 
to our notion of minimising change - is simply given: he does not say how to 
work it out, and, in particular, how to work it out on the basis of physically 
realistic measurements. Consequently, he does not explicitly face the prob- 
lems that we have described above. However, we should note that his account 
may well be susceptible to similar problems: it may work in the abstract, but, 
if we require that the nearness relations between worlds should be capable of 
evaluation in terms of continuously diflferentiable functions of measurements 
made in those worlds, we may end up with a theory incompatible with physics 
[34]. Lewis' approach turns out to be, in Fodor's terms, global: we have to 
know about all of the possible worlds that there are, and we have to know 
these worlds directly as individuals, in order to make causal inferences. 

The difficulty with Lewis' account comes from a quite general result (specif- 
ically, from what is known as Noether's theorem) which has to do with the 
invariance of the laws of physics under coordinate transformation. So, concep- 
tually, these considerations are not infinitely far removed from the problems 
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that circumscription has with invariancc under uniform substitution: and the 
problems with this approach seem to be quite deeply rooted. 

It is, of course, not impossible that one could modify circumscription so 
as to allay these worries, or, indeed, that Lewis' account of coimtcrfactuals 
could be made specific enough to work with physical laws. However, these are 
difficult problems, and they may well be better illuminated by an alternative 
approach. 

1.4 Explanation- Based Approaches Consider the bad history given in (3). 

We might attempt a critique of it as follows: the problem is not that there 
are too many changes, but that we cannot explain one of the facts - namely, 
the falsity of loaded at time 1. All of the other facts in that history can be 
explained, either because of the effects of actions or because they are facts 
unchanged from the previous time; but not this one. So a good constraint to 
put on our logic might be to require that every fact should be explained. 

This approach might have a good deal to recommend it. A good notion of 
explanation would, one thinks, be language-, or coordinate- frame-, indepen- 
dent: the translation of an explanation would surely still be an explanation. 
Furthermore, it has a pleasing directness: human beings like us perform a 
lot of common sense causal explanation, and reflection on this process might 
give us useful data for formalising such explanations. We might also cite the 
philosophical tradition: since the time of Aristotle, at least, causality and 
explanation have been very closely linked [1, p. 32], [7]. 

In order to carry this programme forward, we will have to have a certain 
amount of technical apparatus. We will have to have a notion of explanation: 
the things to be explained will be facts, and we should be able to express, of 
any particular fact, whether it is explained or not. These notions should be 
closed under uniform substitution: that is, the translation of an explanation 
should be an explanation. Such an approach to reasoning about action is, 
indeed, possible: it was formulated by McCain and Turner, and we will study 
it in the remainder of this paper. 

2 McCain and Turners' System 

Their treatment [21] of the frame problem goes as follows. Suppose that we 
have a language £, which will have the connectives of the classical preposi- 
tional calculus. McCain and Turner then consider a binary sentential opera- 
tor: let us write it ■>■. Applications of this operator - of the form ip >iiL' are 
called "causal rules", and they can, in McCain and Turner's treatment, be 
regarded as purely metatheoretic assertions: they read them as causes ip', 
but we will argue for a reading of V explains tp'. A collection of causal rules 
will be called a causal theory. Given a causal theory G in our language £, we 
define an operator (•)®, from models of £ to sets of sentences, as follows: 

M® = {ip\(p\>tp gD,M\\- ip}, (4) 

where " is closure under entailment. We now say that a model M of £ is 
causally explained iff it is the only model of M®. And we say that a propo- 
sition P is a consequence of iff it is true in all of the 0-causally explained 
models of £. 
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2.1 McCain and Turner's Causal Rules Wc will give a simplified version of 
McCain and Turner's causal theory: in order to make our exposition more 
direct, we will be presenting a version of our sequent calculus without the 
quantifiers (we could describe a version with quantifiers, but it would add 
extra technical complexity, and this extra complexity would not be germane 
to the main argument of this paper). 

Theories of this sort will talk of two sorts of entities, fluents and actions: a 
fluent is something which can be true or false at a particular time, whereas an 
action is something which can be performed (or not) at a particular time. We 
represent each of these by a series of temporally indexed propositional atoms: 

ft (iGN) for each fluent (5) 

at {t G N) for each action (6) 

Intuitively, ft is true iff / holds at time t, and at is true iff the action a occurs 
at time t. The atoms ft will be called the fluent atoms: a fluent literal will 
be a fluent atom or the negation of a fluent atom. 

We now describe our causal theory. We assume that the effects of the ac- 
tions can be formalised by giving, for each action o, a finite set of precondition- 
postcondition pairs. Such a pair consists of a precondition and a postcondi- 
tion, each of them a conjunction of fluent literals: the intuitive meaning of it 
is that, if the precondition is true in a situation, then, after the execution of 
the relevant action, the postcondition will be true in the successor situation. 
For example, the precondition of the shoot action is the loaded fluent, and the 
postcondition of the shoot action is -laiive A -iioaded. 

So, for each action a, and for each precondition-postcondition pair {f,g) 
belonging to that action, we add a series of causal rules: 

{ftAat)>gt+i (7) 

where / is the precondition of a, and g is the postcondition. We also have to 
say which actions occur at which times: so, if action a occurs at time to, we 
add the causal rule 

ato > ato (8) 

We also need to deal with fluents that remain unchanged (see [29]): so, for 
every fluent literal ft, we add the causal rule 

ft A ft+i > ft+i (9) 

Finally, we assume that a set of fluents is given which describes the initial 
situation: for each such fluent /, we add the causal rule 

/o>/o. (10) 

Given these rules, this procedure appears to work: that is, it yields correct 
solutions of the frame problem (the reader is invited to check that it works 
with Example 1.1). As a bonus, it is fairly efficient computationally. 
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2.2 The Road Ahead As it stands, though, McCain and Turner's definitions 
are not entirely perspicuous, either mathematically or conceptually. The defi- 
nition of causally explained models is mysterious: we would, thus, like a more 
illuminating treatment of the mathematics. Furthermore, it is difficult to ex- 
plain the conceptual role of o. For example, one of McCain and Turner's rules 
is /o ^ fo (10). But reading > as 'causes' here is simply implausible: proposi- 
tions - or the states of affairs which they denote - are not usually thought of 
as causing themselves. 

So, we will do two things in this paper. Firstly, we will give an alternative 
definition of (4): it can easily be reformulated as a relation between models, 
and relations on a set of models give modal operators. So we can, instead, 
define a suitable modal operator, □; given this operator, we can reformulate 
the definition of M® (4) as 

M® = {<p\M Ih Dip}; 

similarly, the relation 
is equivalent to 

(equivalent in the sense that the two relations give the same set of "causally 

closed" models). 

But, as well as merely technical reformulations, we also want to say what 
these constructions mean. We argue, then, that a better reading of ip>'ip is that 
'ip explains tp^ - and, correspondingly, Dijj can be regarded as a disjunction 
of all of the possible explanations of ip. In fact, this explanatory reading has 
been anticipated: Lifschitz [17, p. 451] paraphrases ipt'tp as "[^/j] has a cause if 
[ip] is true, or . . . [ip] provides a 'causal explanation' for [ijj]" . And, given this 
explanatory reading, reflexive rules seem far less contentious: (4) simply says 
that the proposition /o "explains itself" . Self-explanation is a good deal less 
problematic than self-causation, since explanations, after all, have to come to 
an end (or, of course, loop) at some point - see, for example Wittgenstein 
[37, §217]. In a similar way, McCain and Turner's rule for persistence (9) can 
be easily motivated in terms of explanation: a good explanation for a fluent 
being true at t+ 1 is that it was true at t, and that its truth value is unchanged 
between t and t+1. But it is hard to read this rule causally without, again, 
invoking self-causation. 

3 The Modal Logic 

3.1 Motivation Our system is defined proof-theoretically, but it was arrived 
at by reformulating McCain and Turner's original definition: our aim was to 
find a sequent calculus, with a good proof theory, which corresponded very 
closely to their model-theoretic construction. We proceed by progressively 

reformulating their work. 

Remark 3.1 (Notation) We should explain a few notational conventions, ip 
and tp will always stand for, respectively, the antecedents and consequents of 
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causal rules. General propositions will be written using lower case Roman 

letters, p, q,r, Sets of propositions will be written with upper case Greek 

letters, F, A; 6 will be used for causal theories, i.e. sets of causal rules. Lan- 
guages will be written £, with subscripts and superscripts. Models of our 
various theories will be written M, M', . . . And when, in the appendix, we do 
ordinal analysis we will use lower case Greek letters, but not (fi or ^. 

3.1.1 Reformulation 1: Model Theory First a definition: 

Definition 3.2 A causal theory 6 defines a relation Re on models of the 
language £ by 

M Re M' <=> for every 93 > G O, 

if M Ih ^p, then M' Ih V 

We then have: 

Lemma 3.3 For any p G £, 

M® h p <^ M' Ih p for all M' with M Re M' (12) 
Proof Note first that, for any model M', 

M' Ih M® <(4> for all > V e e. M Ih M' Ih V, 

so we have 

M^V-p ^ M' Vrp for all M' Ih 

O for all M' (for aVupt- ^ Q M h ^p ^ M' W xj)) 

^ M' \^p 
<4> for all M'.M Re M' ^ M' \\- p 

□ 

So we have 

Corollary 3.4 For any model M , M is causally explained according to Q iff 

{M} = {M'\M Re M'} 

3.1.2 Reformulation 2: Modal Models We can now reformulate McCain and 
Turner's theory in modal terms. 

Definition 3.5 Given, as above, a language £ and a causal theory 6, define 
a Kripke model Kq as follows: 

the language: is generated by £ together with the modal operator 
□ . 

the frame: consists of the set 9Jt of models of the non-modal language 

£, together with the accessibility relation Re- 
the forcing relation: is given by the usual Ih relation between elements 

of 971 and propositions of £, extended to modal formulae in the usual 
way. If we wish to be pedantic (but we rarely will), we could call the 
new forcing relation Ih^. 

Notice that we have: 
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Proposition 3.6 For each causal law P>Q, the modal sentence 

P^DQ (13) 

is valid at each world of K. 

Proof This follows directly from the definition of Re. □ 
Then we have: 

Proposition 3.7 For M e dJl, M is causally explained by 8 iff, as a world of 
the Kripke model K@, 

M Ih {Op ^p)A{p^ Dp) (14) 

for any p. 

Proof This is a standard modal reformulation of the condition 

{M} = {M'\M Re M'}; 

see, for example, van Benthem [32]. □ 

This result is pretty rather than useful: it refers to a particular Kripke model, 
and models are hard to present in any effective sense. 

We can make progress, though, by asking how other Kripke models are 

related to Kq. 

Definition 3.8 Let K be any other Kripke model of £□ such that, for any 
ipt'ip G Q, and any world w of K, 

w ip Dtp. 

Then define a map 

1] : worlds of K — > worlds of Kq 
by sending a world w of K to 

{p\p e £,w\h p} (15) 

Note that the sets of propositions defined by (15) are, by the properties of 
Kripke models, models of £: consequently, these sets of propositions are, in 
fact, worlds of K@. 

Proposition 3.9 Forw,w' worlds ofK, if wRkw' , then r]{w)R@r}{w'). 

Proof Suppose that ipt>^ G 0, and that r]{w) Ih (p. Then, by definition of 

rj, w Ih (p. However, by the assumption on K, w Ih 99 ^ □■0: since wRkw', 
we have w' Ih ip, and so, by the definition of rj, r]{w') Ih tp. This is so for any 
ifit'tp G Q, and thus we have Tj{'w)Rer]{'w'). □ 

Proposition 3.10 For any world w of K, and any non-modal proposition p, 
ifr]{w) Ih Dp, then w Ih Op. 

Proof Immediate from the above. □ 
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This gives us an intuition as to what characterises the model Kq: among all 
of the models which globally satisfy (13), it is the one in which □ is strongest 
(or, alternatively, the one in which the relation R is the largest). This shows 
us how to find a sequent calculus which corresponds to the model K0, and 
which, consequently, represents the McCain- Turner procedure. 

There is a standard way of getting such a sequent calculus (see, for example, 
Hallmis and Schrocder-Hcistcr [9, 10]). We first give ourselves right rules for 

□ corresponding to the constraints which □ must satisfy: then we find left 
rules which invert the right rules. The constraints which □ must satisfy are 
quite simple: ip Dij; must be a theorem, for every (pt> E Q. Rather more 
generally, if we have causal rules {'Pi>4'i}ieij ^ind if we have {f/'ilie/ ^ Pi then 
we must have {ipi\i^i h Up: this gives us a right rule for □ which ensures that 

□ is a K modality. The left rule will, correspondingly, invert this right rule: 
and we get the rules for □ in Table 1. And, as the remainder of this paper will 
show, our system does, indeed, give a sequent calculus for the McCain- Turner 
procedure. It allows us to recover the Kripke model Kq, which turns out to 
be the canonical model of our modal logic. 

3.2 The System Our system will be given by a sequent calculus, as in 
White [36]; it is given by the rules in Table 1, whose formulation depends on 
an underlying set of causal rules. As explained above, we will, for the sake of 
simplicity, present the propositional version of this system: we could present 
a first-order system, but at the price of extra technical complexity. 

We should note that this sequent calculus introduces a new language, £□ - 
an extension of £ by the modal operator □ - and a new consequence relation, 
1^. Our cut elimination theorem will show that £□ is a definitional extension 
of £, and that h is the restriction of to £: but, for the moment, we will be 
careful to respect the differences between the two entailment relations. 

Remark 3.11 The following considerations may make the calculus, and the 
connection between £□ and £, more perspicuous. If we were to assume that 
£ has arbitrary disjunctions, our cut elimination result would show that we 
could write 

Up = \J (fiiA...A(pk 
■!/'i,...,i/'fcbp 

In this case, the left and right rules for □ would simply be the left and right 
rules for a disjunction Vigj namely 

r,VftH.A ^ and r^^yp, (16) 
iei iei 

This explains the form of the side conditions ipi, . . . jtpk P in the left and 
right rules for UX: they take the place of i G / in the left and right rules for 
Vig/- Just as in those latter left and right rules, the conditions ^1, . . . , V'fe ks P 
occur positively in both left and right rules: they are side conditions and not 

premises. 

Remark 3.12 We should remark that the rules for □ are, in general, infinitary 
(and even when finite, the set of premises of DL can well be undecidable). The 
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Table 1 Sequent Calculus Rules 



Ax 

_L L TR 

r^A r^A 

LW RW 



r,pfeA rfep,A 

r,p,pfeA T\-nP,p,A 
LC RG 

r,pfeA rfep,A 

r^p,A r,pfeA 

— iL — iR 



r, A r -p, A 

r,p,(7feA rfep,A rfeg,A 

AL AR 



^,pAg^5A r[BpAg,A 

r,ph^A r,gfeA rfep,g,A 
VL VR 

T,pypk,A rk,pVq,A 

rkiP,A T,qkiA T,phBq,A 



T,p^qk>A Tk>P^q,A 
r fa A . . . A <^fe, A ipi,...,ipkk,p 



R 



□R 



□L 



r fB Dp, A 

{r, {-yi^ilie/,- A, {V'i}ie/j ^ J 

r, Dp fB A 
rfep™,A r',p"feA' 

multicut 

r,r'fe A,A' 

conditions on the rules: 

□R where, for all i, ipi >ipi is a causal rule. 

□L for each appropriate i, we have a causal rule ipi t> ipi, and 
where the {(/Si >'0i}ie/j a-re the only such finite sets of ips and 
tps, such that {'ipi}ieij ^□ P, that there are (J need not be 
finite) . 

multicut p" stands for n occurrences of p; m, n > 



qualification "in general" is important here: cut elimination will show that, for 
typical applications, this rule very often has a finite, decidablc (and, indeed, 
very tractable) set of premises. Indeed, many applications for example, that 
described in Appendix B - will only use the right rule for □, which is much 
more tractable. Furthermore, even when the rules are actually infinitary, the 



Causality, Modality, and Explanation 



13 



system is still very useful for mctatheoretical purposes: after all, the proof- 
theoretic use of infinitary systems has a very long and respectable history (see, 
for example, Fairtlough and Wainer [6, p. 164]). 

Example 3.13 Here is an example of how the left rule might plausibly be 

used. Michael Scriven [28, p. 61] describes the following argument pattern: 
we know that event p occurred. We know that the causes of p are a, b, or c: 
but we also know that neither a nor b occurred. So we conclude c. 

In our system, this would go as follows. We have p, and we are supposing 
the world to be explanatorily closed, so we can conclude Dp (here we are 
doing causal explanation, so our explanations really are causes). We can also 
conclude that 

□p = a V 6 V c; 
we also have -ife and -ic, so we conclude a. 

4 Consequences of Cut Elimination 

The proof of cut elimination is technical (it relies on ordinal induction), and is 
given in the appendix. It has important consequences, however, and we give 
some of them here. 

Corollary 4.1 The modal theory is a conservative extension of the non-modal 
theory: that is, if T and A are non-modal, we have 

r h A iff Tk,A. 

Proof Any proof of F h A is a proof of F fa A; however, any cut free proof 
of F Ib A cannot involve the modal rules, and is thus a proof of F h A. □ 

So, from now on we can safely ignore the distinction between h and f^- 

4.1 Semantics We can now prove a soundness and completeness theorem 
for our logic. We do this as follows: we first identify our model K@ of Defini- 
tion 3.5 as the canonical model of this modal logic. That is, semantic validity 
in this single model will entail syntactic validity. We can, then, prove sound- 
ness by showing that any syntactically valid sequent is also semantically valid 
when interpreted in the canonical model, and prove completeness by show- 
ing the converse. Recall that Kq has, as worlds, models of the non-modal 
language £, and that the accessibility relation is given by the relation Re of 
Definition 3.2. The corresponding modelling relation between worlds of Kq 
and sentences of £□ will, as in Definition 3.5, be written 

For convenience of notation, we identify models of £ with the sets of non- 
modal sentences true in them, and these arc just the maximal h-consistent 
sets of nonmodal sentences: let Tl be the set of all such models. Clearly, for 
A nonmodal and M e we have 

M\\-a iff aeM iff Mho. 

For this section, letters a, a',. . . will range over nonmodal sentences, whereas 
p,p', . . . will range over the whole of 

The proof of the following is immediate: 

Lemma 4.2 For any M and any non-modal a, M II-q a iff M \\- a. 
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Wc now define a semantic entailment relation, !=□: this will be like the usual 
notion of modal semantic entailment, but with respect to the single model 
Kq. As we have explained, Kq will turn out to be the canonical model of our 

logic, so the restriction to Kq is harmless. 

We first define semantic values for sentences in 

Definition 4.3 For sentences p in define their semantic values [p] C 
as follows: 

|a| = {M\M Ih a} for a a nonmodal atom 

l^pj = m- M 
IpAqj = MnM 
bv^l = MUM 

[□p] = {M|VM'.M Re M' ^ M' € Iqj} 

The following lemma is immediate: 

Lemma 4.4 For a nonmodal, [a] {M E Tl\M Ih a} 

Definition 4.5 For F, A C we say that 

FNn A iff fibl C UM- 
per qeA 

We also need to generalise our operation (•)® of (4) so that it can be applied to 
general subsets, rather than simply models, of £. We recall that, for M G 9}l, 
M® {ip\M \\- if ioi some ip > ip} . 

Definition 4.6 Let S C £: define 

5® =^ {a G & \ S \- tpi V . . . V ifk ior rules ipi>tpi, with \- a for all i} 

Note that, if 5 is a model, the new definition coincides with the old one. We 
need the disjunction of ips in order to prove the following lemma, which will 
be crucial: 

Lemma 4.7 // {Si}i^i is a set of subsets of £, with each Si closed under 
entailment, we have 

iei Vie/ / 

Proof The right to left containment is trivial. For the left to right contain- 
ment, suppose that a G Sf for all i. Thus, for all i, we have 

Sih ipi^,. . . , Lpi^^ with ^ a for all j. 
Then, since the Si are closed under entailment, we have 

But, by the compactness of £, wc can find a finite set of {ipi >ipi} with 
^ Si \- (fii, . . . ,(pn with ipi\- a for all i, 



Causality, Modality, and Explanation 



15 



and this establishes the result. □ 
We now prove soundness and completeness. 

Lemma 4.8 The Kq semantics is sound and complete for inferences of the 
form S ki Op, for F C £ and p G £□ ; that is, 

Proof We prove this by induction on the complexity of p: for p nonmodal, 
it is, by Corollary 4.1, simply soundness and completeness for the predicate 

calculus. The inductive steps for the nonmodal connectives arc quite standard: 
there remains the case when p — Dq. It will suffice to show that 

W = [jlfi,! A . . . A ^i,k,j 
i 

where, in the usual way, (pij>ipij for all relevant i, j, where ipi^i, ■ ■ ■ , ipi^ki 9 
for all i, and where these are all of such finite sets of causal laws that there 
are; the right to left inference gives soundness, and the left to right inference 
gives completeness. Note that the semantic values of Oq, for any q, are a 
union of the semantic values of nonmodals, and so 

[□gl= U H 

Now, if a is nonmodal, 

M c |ng] 

iff VM.M Ih a.VM'.M' Ih ^ M' \ba q 
expanding the definitions of [Dg] and of Re 

iff VM'.(3M Ih q.M' Ih M®) ^ M' Ihn q 

iff yM'.M' \\- (jA}Y ^ M'\\-aq 
(where {a} is the deductive closure of a) by Lemma 4.7 

ifiF (w)°l^o9 

by the inductive hypothesis. Now expansion of the definitions shows that 

(with the usual restrictions on the (pij), which was to be proved. □ 

Recall now that the forcing relation, Ihn, on the Kripke model is defined 
inductively using the structure of Kq. We can now show that, in fact, it is 
the same as provability. 

Corollary 4.9 For a model M G^M and p e Sin, 

MhuP iff Mh^p. 
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Proof By soundness and completeness, 

Mfep iff [MlC[p]; 

but [M] = {M}, so 

Mk,P iff M G M 
iff M\\-aP 

by definition of Ihn □ 

Theorem 4.10 The Kq semantics is sound for general sequents. 

Proof We prove this by induction over the length of a proof: the nonmodal 
rules are straightforward, and we use Lemma 4.8 for the modal rules. □ 

Lemma 4.11 For anyp e and for any M e DJl, either M p or M -^p. 

Proof We prove this by induction: since ks coincides with h on £, and since 

M is a model of £, the result is clearly true if p is nonmodal. The crucial step 
is to prove the result for propositions of the form Dq: here we argue as follows 

Suppose M ^ □ g 

then M,nq ^ ± 

and, by the left rule for □, this holds iff, for some set of causal rules I, 

which, since M is a model of £, in turn holds iff, for any i G 7, 

M fa 

and so, by the right rule for □, 

M Uq 

By cut elimination, no element of £□ can be both entailed and not entailed 
by M. □ 

Corollary 4.12 The maximal [^-consistent subsets of Zu are in one-to-one 
correspondence with the maximal \--consistent subsets of Z (i.e. the elements 
ofTl). 

Proof Intersection with £ gives a morphism in one direction: Lemma 4.11 
shows that every maximal h-consistent subset of £ can be uniquely extended 
to a maximal fs -consistent subset of □ 

Theorem 4.13 The Kq semantics is complete. 

Proof It suffices to show that, if F C £□, and if F then |F] is not empty, 
i.e. that there is some world M of Kq with M € [FJ. But, if we have F 
we can extend F to a maximal fa -consistent subset of £□: this will correspond 
to an element M of OJt. By Corollary 4.9, this is the world we want. □ 
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4.1.1 Interpolation and Finiteness We first prove the following interpolation 

theorem. 

Proposition 4.14 (Interpolation) // we have T Dp, A, for some p and 
r, A C then there is a set Aq C £ such that 

rfe Ao,A 

and, for each a G Aq, there is some b € £ such that 

a ^B and 

Proof We prove this by a straightforward induction over a cut free proof: 
because we might use the contraction rule on dp, the inductive hypothesis 
will be the same as that of the theorem but with the slightly weaker condition 
that r {Op)"^, A. The elements a of Aq will turn out to be finite conjunctions 
of rule bodies, whereas the 6s will be the corresponding conjunctions of rule 
heads. □ 

One of the key applications of our cut elimination theorem is in proving finite- 
ness results, such as the following: it is applicable to the modal implications 
that we will use in the next section for interpreting Parsons and Jennings' 
logic of argument, so it is quite significant. 

Corollary 4.15 If 

T\-np 

is provable in the sequent calculus, with T a set of nonmodals, then there is a 
finite set V C V, and nonmodal propositions A and B, such that 

r'ha, 

a fs and 
bkip. 

Proof Wc apply Theorem 4.14, which gets us a possibly infinite set A with 
r ^B A. But, in this case, V and A are both classical, so we can apply 
Corollary 4.1 to show that F h A. We can now apply compactness to find a 
finite Aq C A, and, taking disjunctions, we get a single a and b. □ 

5 Explanation and Argument 

As we have argued, McCain and Turner's theory seems to be a very natural 
formulation of explanation in general: although its original application may 
have been to a causal context, there is nothing about it which forces these 
explanations to be causal. Once we broaden our horizons to general explana- 
tion, we can bring this modal system into contact with other work: we give 
one example of this, the theory of Parsons and Jennings [23] (see also Parsons 
et al. [24]). We could give other applications: for example, where we have 
explanations, questions must also be in the neighbourhood, and thus we can 
also express a good deal of the formalism of Belnap and Steel [2] (see Harrah 
[11]) in terms of ours. 
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5.1 The Parsons and Jennings System Parsons and Jennings ([23]; sec also 
[24]) have described a consequence relation, I-acr, which is intended to cap- 
ture the practice of argumentation. The items which this system manipulates 
are ordered pairs (p, F), where p is a proposition and A a set of propositions: 
we will call such a pair an argument. Intuitively, p is the conclusion of an 
argument, and F is the set of its grounds. We will also call p the head of the 
argument, and F its body. 

The system is given in Table 2: here G is a set of basic arguments, and it 
plays exactly the same role in their system as a causal theory does in McCain 
and Turner's. Parsons and .Jennings write their system in natural deduction 
style, with introduction and elimination rules, using sequents of the form 
l~ACR {p, A): such a sequent says that A is an argument for p, given basic 
arguments O. 

(Note that we have interchanged the labels on the rules -il and RAA: 
Parsons and Jennings' original labelling is clearly a typo of some sort.) 

Table 2 The Parsons and Jennings System 



(p>r)ee . 

Axiom Tl 



ehACR(p,r) ehACR(T,0) 

ehACR(pA9,F) 

ehACR(p,r) ehACR(9,n ^ — ——aei 

■ Ai I-ACR [p, r) 

AE2 

e^ACR (g,r) 



ehACR(pAg,FUF') 



^ACR {p, r) l-ACR {p V q, F) 

0,(p,F)hACR(r-,F') 



0^ACR(pVg,F) 

VI2 0hACR(r-,F'uF") 



0^ACR(pVg,F) 

e,(p,0) hACR (^,r) ^ 0KACR(p,r)9hACR hP,r) 

0^ACR(-?>,r) " 0hACR(^,r) 

0,(p,0)hACR(g,r) ^ 0hACR(p,F)0hACR(p^g,r) 

0^ACR(p^g,r) ^ 0hACR(9,ruA) 

hACR (^, r) 0, {^p, 0) hACR {±, r) ^ ^ ^ 

EFQ RAA 

0^ACR(p,r) 0^ACR(p,r) 



E 
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5.2 Comparison with Our System We can now translate Parsons and Jen- 
nings' system into ours. 

Definition 5.1 Let Q \~acr (PiT) be a sequent in Parsons and Jennings' 
system. Its modal translation is the sequent 

r h DeP, 

where n@ is the modal operator defined by causal laws 

/\<i]>p \ (p,r)€e 

Since the Parsons and Jennings system is written in natural deduction style, 
some of the rules (for example I) manipulate the set of basic arguments: 
consequently, the modal operator in the modal translation will vary. We will, 
then, need the following lemma: 

Lemma 5.2 // □ is the modality associated to a set G of explanations, and 
if □' is that associated to QU {{ip,(fi)}, then define an interpretation 



by 



a : 




a{p) 




a(p A p') 


— a{p) A a{p') 


a{p V p') 


= a(j>) V a(j>') 


a{^p) 


= -.Q!(p) 


a{a'p) 


= {tp A D{i> —>■ a{p))) V Da{p) 



for p atomic 



for any p. Then, for any F, A C £□/ 

Fhn, A iff a(r)fea(A) 

Proof Wc check that F and oi{r) have the same semantic values (regarded 
as subsets of 9JI), and similarly for A; we then apply the soundness and 
completeness theorems. □ 

We have the following 

Proposition 5.3 The modal interpretation is sound; that is, each of Parsons 
and Jennings' axioms is translated into a tautology. 

Proof 

Ax: is 

ehACR(p,F) 

and this follows from our definition of the modal translation. 
A— I, -1— E, E: A — I, for example, is 

ehACR(p,F) ehACR(g,FO 

©^ACR(pAg,ruF') 

and this follows from the K tautology Up AQq \- Op A q. -i — E and 
— » — E are similar. 
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A-El, A-E2, V-Il, V-I2, EFQ: A - El, for example, is 

e^ACR(pAg,r) 

e hACR (p, r) 

and this follows from the K tautology h 0(0 A 6) ^ Da. A — E2, V - 11, 
VI2, and EFQ are similar. 
T— I: This is just 

h DT, 

a K tautology. 
V-E: This is 

ehACR(pvg,r) e,(p,r)hACR(r,ro e,{q,T)\-ACK{r,T") 
ehACR (r,r'ur") 

and this corresponds, in our system, to 

a^a0{pVq),b^ □eu{(p,r)}r-, c □eu{(q,r)}r- 

h 6 A c — > Dsr 

Wc can use the lemma to express nQy^p p-ji, and DeyKg p-ji, in terms 
of De ; some routine but tedious computation then reduces this case to 

□e(pVg),ne(p^r),ne(g^r)hner (17) 

which is a K tautology. 
RAA, -1— I: -i-I is 

e,(p,0) hACR 

e l-ACR hp, A) 

which corresponds to 

A □eu{(p.0)} -LI- ^ ^ Oq-'p; 
using the lemma on □eu{(j),0)}7 and some computation, reduces this to 

□ _L V □ (p ^_L) h a^p 

which is a K tautology. -i-I is similar. 
^-I: This is 

9, (p, 0) hACR (g, A) 
© ^ACR {p q, A) 

which corresponds to 

A □0u{(p,0)}g \-A^De{p^ q). 
The usual moves reduce this to 

Uq V U{p q)\- U{p —>■ q) 
again a K tautology. 

□ 

Completeness does not hold. This is for trivial reasons: all rules (except VE) 
of the Parsons and Jennings system leave the body of the argument intact. A 
trivial induction on the length of proofs will yield 
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Proposition 5.4 In any proof of 

Q^ACR{p,r), 

A must be a union of the bodies of rules in 6. 

Since the modal sequent calculus certainly does not satisfy this condition, we 
cannot hope for completeness. What we need to do is to be able to compose 
proofs in the Parsons and Jennings system with natural deduction proofs for 
the grounds of an argument: we could, theoretically, write down another set 
of rules for doing this. However, we only need one extra rule, which is this: 

Definition 5.5 Let classical \/E be the following rule: 

rhgVr ehACR(r,r'u{g}) ehACR(r,r"u{r}) 
VEC 

ehACR (r,rur'ur") 

where A h g V r is an entailment in classical natural deduction. 
We clearly have 

Proposition 5.6 The modal translation is sound for classical or-elimination. 
And we can also prove completeness: 

Theorem 5.7 The modal translation is complete: that is, given a proof of 

r h 0@p, (18) 

there is a proof, in the Parsons and Jennings system together with classical 
or-elimination, of 

e^ACR{p,T) (19) 
Sketch of proof We establish the following lemma: 
Lemma 5.8 If p is non-modal, given a sequent calculus proof of 

V'i,...,Vfe i-p, 

then there is a Parsons and Jennings proof of 

1- {p, Wi, ■ ■ ■,'Pk})- 

This lemma can be proved by first transforming the sequent calculus proof to 
a natural deduction proof, and then observing that the Parsons and Jennings 
rules mirror the rules of classical natural deduction. 

So now we can prove the theorem: we take a proof of A h Dqp, and from 
it derive a finite set I such that 

r h V ^ii A . . . A ^i,^ (20) 

0|-acr(p, {¥'»i,---,<Pi^}) for any? (21) 

We then use classical or-elimination in order to glue together (20) and (21). 

□ 
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Remark 5.9 As wc sec here, the natural deduction formulation is actually 
quite ambiguous as to what its premises are: in a proof of 

ehACR(p,r), 

are the premises the basic arguments G, or the grounds F for the argument 
which is to be established? Now the system is set up as if the premises are the 
set 6 of basic arguments, and this gives a sense of composition of arguments 
which is valid: that is, from 6 Kacr {p, T) and 6', {p, F) I-acr {q, F'), we can 
establish 6', 6 I-acr {q^^')- But this is not enough: we also want to regard 
the grounds of arguments as premises. 

The situation is clearly two-dimensional in something like Pratt's sense - 
he defines the dimension of a logic to be "the smallest number of variables 
and constants of the logic sufficient to determine the remaining variables and 
constants" Pratt [26]: the modal operator can be varied quite independently 
of the classical connectives, merely by altering the set of causal rules. Con- 
sequently, a formalism such as Masini's Masini [19] Masini [20] may well be 
more appropriate. 

Remark 5.10 This translation between sequent calculus and the Parsons and 
Jennings natural deduction is, in addition, not very sensitive to the structure 
of proofs on either side: natural deduction proofs tend to transform the con- 
clusion of the argument quite extensively before coming down to basic argu- 
ments. Sequent calculus proofs, by contrast, leave the conclusion unchanged 
until an application of UR, after which the proof is a matter of standard 
classical logic. In addition, the Parsons and Jennings system only represents 
a fragment of the full sequent calculus (namely, the entailments in which □ 
only occurs on the right). A natural deduction formulation of the entire se- 
quent calculus would be interesting, but would have to extend the Parsons 
and Jennings system quite considerably. 

6 Conclusion: Two Approaches to Nonmonotonic Reasoning 

There are two approaches to nonmonotonic reasoning: there is the generally 
accepted one, which may well be susceptible to Fodor's critique. It can be 
summarised in Brewka's words: 

To formalise human commonsense reasoning something different 
[from classical logic] is needed. Commonsense reasoning is frequently 
not monotonia. In many situations we draw conclusions which are 
given up in the light of further information. [4, p. 2] 

According to this view, nonmonotonic logic is applicable globally: it applies 
to all of our commonsense reasoning. There is, so to speak, a small frag- 
ment of our reasoning which happens to be monotonic: but commonsense is 
nonmonotonic by default. There would, then, be a large amount of reason- 
ing which was both nonmonotonic and global. This, if true, would be fairly 
catastrophic. There has been quite a lot of recent success in implementing 
nonmonotonic logic, but it is still true that, precisely because of this globality, 
performance scales quite badly [5] . So, if we are supposed to do it on a very 
large scale, it cannot be expected to perform well. 

Contrast this with the McCain- Turner system. There are two aspects that 
we need to examine: firstly, reasoning within the system itself, and, secondly, 
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the relation between the set of causal rules and the modal operator. The 
system itself is monotonic. It is a modal logic of a fairly well-known sort, and 
it has a sequent calculus which admits cut elimination; we can, then, search for 
proofs efficiently. The dependence of the modal operator on the set of rules 
ip> ip is, however, non- monotonic: if we add new rules, thus changing the 
modal operator, then we may invalidate modal inferences that we previously 
made. However, only the left rule for □ is thus sensitive; the right rule is not. 
Consequently, although our system is nonmonotonic, it is so in a quite limited 
way: it is not susceptible to the sort of "anything might entail anything else" 
worries that Fodor has. 

There is a final remark to be made. Many of the tractability results for 
our system do not follow directly from its definition, but are established on 
the basis of metatheory, and quite technical metatheory at that. Now this 
would, perhaps, be a fatal objection if we were supposed to adopt this system 
by means of introspection. However, if we are supposed to acquire styles of 
reasoning on the basis of natural selection in the manner of the cognitive 
science described in [8, ch. 5] - then none of this matters: the metatheoretical 
results apply to the system, and give it the evolutionary advantages that it 
arguably has, regardless of whether or not the early primates who, maybe, 
adopted these styles of reasoning were, or were not, familiar with technical 
results in proof theory. 
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In the appendix, wc have material that docs not fit naturaUy into the 
argumentative structure of the main paper: the cut ehmination, which is too 
technical, and some material on Turner's logic of universal causation, which 
relates what we have done to previous work. 

Appendix A Cut Elimination 

We now prove our cut elimination result. This needs a certain amount of 

machinery, because our system is, in generally, infinitary: proof trees may, 
therefore, be infinitely branching, and - since we cannot apply Zorn's lemma 
- there may be branches of infinite length above a given node. Because of this, 
we cannot assign a finite depth to each node, and, consequently, we cannot 
prove cut elimination in the usual manner, that is, by an induction on both 
the depth and the complexity of the cut formula. We can use an induction, 
but it must be an induction over ordinals: there is a machinery of ordinal 
analysis, which is used for the proof-theoretic analysis of infinitary systems (for 
example, arithmetic with the w-rule). We will generally follow the treatment 
in Pohlers [25], with the modifications necessary because our system is two- 
sided, not one-sided, and because our sole infinitary connective is □ rather 
than the infinite conjunctions and disjunctions which Pohlers investigates. 
Our case, in fact, is rather simple, because, if we regard □ as a possibly 
infinite disjunction of finite conjuncts of non-modal formulae, the complexity 
of the disjuncts is bounded a priori by u. 

In this section, since we want to treat entailments uniformly, we will write 
rule applications as follows: 

rfe A 

where / can have zero, one, or two members (for the standard rules) or an 
arbitrary number (for DL). 

We first define the rank of a formula: 

Definition A.l If £ let its rank rk(F) be the ordinal defined inductively 
as follows: 

1. rk(F) = if is atomic, 

2. rk(^i^) = rk(Va;.F) = rk(3x.i^) = rk(i^) + 1 

3. rk(Fi A F2) = vk{Fi V F2) = rk(Fi ^ Fi) = max(rk(Fi), rk(i^2)) + 1 

4. rk(nF) = 0. 

Note that rk(nF) is independent of F: this is because, as we have remarked, 
we can give an a priori bound on the disjuncts that UF can be thought of as 
expanding into. We have 

Proposition A. 2 For any F e rk{F) < w. 

Proof Immediate. □ 
We define the following classification of inference rules: 

Definition A. 3 The finitary rules will be all of the rules apart from DL. The 
non-cut rules will be all of the rules except cut. 
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Next, wc define entailment symbols annotated with both the rank of cutfor- 
mulae and the depth of the proof tree. There are two classes of inference rules: 
the finitary rules, and DL. DL is the only infinitary rule; our proof trees, then, 
will consist of portions where finitary rules are applied, separated by applica- 
tions of DL. We will treat the two kinds of rule application differently, and 
we will, therefore, use two ordinals for measuring the depth of the tree. C will 
mcasiire the number of applications of DL above the root of the tree which 
have proofs involving cuts above them; it will, in general, be infinite, a will 
measure the number of finitary rule applications above the root of the tree, 
but will be reset to zero after each application of DL: it will be finite, p will 
measure the rank of cut formulae, and will likewise be reset to zero after any 
application of DL: it will, therefore, be finite. The last two rules will make 
the annotations fit smoothly with our inductive arguments, which will first 
over ^, then over p, then over a. 

Definition A. 4 Define the entailment relations h^' as follows: 

1. If r A is an instance of Ax, of _L L, or of TR, then 

r^rA 

2. If we have an instance of a finite non-cut inference rule 

rfe A 

and if, for all i € /, Fj Aj where, for all i, ai < a, then 

Fh-^A. 

3. If 

rfe A 

is an instance of DL, and if, for all i, Fj h^"*^ A^ where Q < ^, then 

rC^A. 

4. If 

Ffe A 

is an instance of DL, and if, for all i, Fj 1-^."''' Aj where either some 
> or some Pi > and where Ci < C for all i, then 

Fh^A. 

5. If 

Fh^^'^AA and T',A^;'^ A' 
with ai,a2 < a and vk{A) < p, then 

F,F'h;'^A,A'. 

6. If F h^' ^ A, and it a < P, p < a with /3 and a finite, then 

Fhf'^A 
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7. If r h^' A, and if C < then, for any finite /3 and 



The following lemmas follow by a trivial induction: 

Lemma A. 5 // F A, </ten, /or some finite a and p, and some ordinal (, 
Lemma A. 6 A proof T \^ A is cut free iff it has an annotation of the form 

rC° A. 

Theorem A. 7 Given a McCain-Turner causal theory, the corresponding 
modal system satisfies cut elimination. 

We prove this by the following lemmas: they arc very much the same as the 
corresponding lemmas in [25, pp. 60ff.]. Note that a#/3 stands for the so- 
called natural sum of the ordinals a and /? ([25, p. 43]): the relevant facts 
that we shall use are that it is defined for any pair of ordinals a and (3, and 
that it is strictly monotonic in both arguments. 

Lemma A. 8 // 

rh^'''X,A and 
and if rk{X) = p, then 

,^a+/3,C#^ A, A' 



T',X^p"' A' 



Proof Wc prove this by induction, first on C#r], then on a + p. There are 
four cases. 

1. One of the premises is an axiom, ± L, or TR, and X is not principal in 
it: in this case, r,r' A, A' is an instance of the the same rule, and, 
by the definition of F, T' '''^^ A, A', we have the first case of the 
result. 

2. X is non-principal in one or the other of the premises (suppose wlog 
the left one) , and that premise is the conclusion of a finitary rule. Then 
the prooftree on the left looks like 

n,: 



Ti^;'^X,Ai 



Fif^XA 

Since, for all i, < a. (and, consequently, a..; + [3 < a + /?), we can 
assume the hypothesis inductively for cuts between ^ X, Ai and 

T',X \^'''^ A', obtaining proofs H^: we complete the proof as follows: 

Hi 



F,,r'h;-+^'^*^ A,, A' 
F,r'h-;+'''^*'' A, A' 
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3. X is non-principal (wlog on the left), the bottom sequent on the left 
is the conclusion of DR, all of the premises on the left have ( = p = 0, 
and the sequent on the right has r] = p = 0. So, the prooftree on the 
left looks like 

n,: 



We can apply the lemma inductively, obtaining proofs of 

, ai+/3,0 

we now complete the proof as follows 

n,: 



r,,r'h?'+^'°A„A' 



r,r'hr A,A' 

4. X is non- principal (wlog on the left), the bottom sequent on the left is 
the conclusion of OR, and we do not have the previous case. So, the 
prooftree on the left looks like 



Ti\^'^' X,Ai 



rh-;'^x,A 



iei 



where, for all i, Q < ^. We can apply the lemma inductively, obtaining 
proofs of rj,r' Aj, A', where, for each i, Q < Ci < Ci ^ind 

where p- = max(/9j, rj): we now complete the proof as follows 



Ti,T'\^'^' A,, A' 



r.r'h-r A,A' 

and apply case 6 of Definition A. 4 to adjust the values of a and p. 

5. X is principal on both sides; there are various cases, depending on the 
principal connective. We treat two of these cases: the others are very 
similar. 
— >: The proofs are 



and 



Z,A 



R 



r'h^"''y,A' r',zhf'''A' 



r',Y 



Zh^'^A' 
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with ai < a and /3i , /?2 < P- We transform this into 

n Hs 



-p p/ ipi+/3i. C#'? 



TXC^''*^ A, A' 

where n is produced from Hi and 112 by the inductive hypothesis: 
we have rk(F),rk(Z) < p, Q!i#/3i < q;#/3, and /3i < Q!#/3, so the 
final inference is justified. 
□: The proofs are 

n' 



n 



rh-;^'^y)iA...Ayfc,A 
rh"'^ DY,A 



and 



□R 



I r , (^,1 , . . . , hp A J 



r',nFhf^A' 



□L 



where a\ < a, r]i < r] for all i. 

Wc construct a new proof as follows: the tuple (pi, . . . ,ipk must 
match one of the tuples indexed by /: say it matches iq. So we 
now have proofs 



n 



and 



\ AL 

r',<^iA..JA<p,h^+'=-''''- A' 



■ AL 



We now have a proof of 

r,r'h",'''''° A, A', 

and < ry, so we can now apply case 7 of Definition A. 4, and 
obtain the result. 

□ 

Proof of Theorem A. 7 Suppose that we have a proof involving cuts, i.e. a 
proof of a sequent 

r h;+^, A (22) 

We prove that this proof can be replaced by a cut free proof of the same 
sequent by an induction. The inductive hypothesis will be that, given such a 
proof, there is a cut free proof 

rho'°A 

with some a' . We perform the induction first on ^, then on p, then on a. 
There are four cases. 
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1. If the last inference is an application of DL, then the premises are 

(X- C- 

of the form Fj h;,^" ' with, for all i, < C- We can assume the 
result inductively, obtaining cut free proofs of the premises, and we 
can complete the proof as follows: 

which is a cut free proof of (22). 

2. If the last inference is a cut of rank less than p, then the premises will 
have the same value of C and p, but smaller values of a: inductively, 
we can assume that these premises have cut free proofs. So we now 
have a proof as follows: 

r'h-o''°x,A' r",xho"'°A" 



r',r"h^;'' A',A" 

with p' < p + 1. So, by our inductive hypothesis, we can assume the 
result. 

3. If the last inference is a cut of rank p, then its premises will be of the 
form 

r'h;;fA' and r"h;v/A" 

We can apply the result inductively to obtain cut free proofs of the 
premises, which (after applying Definition A. 4 case 6) can be assumed 
to be of the form 

r'hf'°A' and r"hf'°A"; 
we then apply Lemma A. 8 to obtain a proof of 

We have possibly reduced the value of C) and certainly reduced the 

value of p, so wc can assiimc the result by the inductive hypothesis. 

4. If the last inference is an application of a non-cut finitary rule, then 
its premises must have the same value of C and p, but smaller values 
of a, than the conclusion: wc can again apply the result inductively to 
the premises, and then apply the rule to the premises, giving a cut free 
proof of (22). 

□ 

Appendix B Turner's Logic of Universal Causation 

As we have indicated. Turner [31] also has a modal system which he uses 
as a mctathcory for the McCain- Turner procedure. His treatment has the 
following features: 

1. We are given an S5 modal operator, written C, and a theory T in the 
language of that operator 

2. Recall that a Kripke model of S5 is 

(a) a set W of worlds. 
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(b) a truth-functional forcing relation between worlds and proposi- 
tions of the no n- modal language, and 

(c) an equivalence relation on the set of worlds. 

We then say that an S5 Kripke model of the theory T is causally 
explained if 

(a) it has a single world w 

(b) if any model of T has a set of worlds W which is a superset of 
{w}, with a forcing relation extending that for w, then {w} is a 
single equivalence class in the larger model. 

(Equivalcntly, the inclusion of w in W is what is called a p- 
morphism; see van Benthem [32, 33]). 
3. Then we say that a proposition is causally explained if it is forced at 
the unique world in all causally explained models. 

Now this definition of a causally explained model involves not simply a model, 
but all models which contain it (in some appropriate sense of 'contain'): it is 
not obvious that this definition can be replaced with one which only talks, in 
the standard way, about theoremhood in a single model. In fact, as we shall 
see, it cannot. 

Example B.l Consider a language with a single atom p, and two theories in 
that language: 

T, = {p^ Cp} (23) 
T2 = {p^ Cp, -^p C^p} (24) 

Then Ti has a single causally explained model: its world forces p. T2 has two 
caiisally cxplain(xi models, one with a single world forcing p, and one with a 
single world forcing -^p. Consequently, the proposition p is causally explained 
by Ti, but not by T2. 

This has, as a consequence, 

Proposition B.2 There is no set of propositions T such that, for any S5 
theory T, 

p is causally explained by T T,T I-55 p. 

Proof If there were such a set of propositions F, then the relation 

p is causally explained by T 

would be monotonic in T: as Example B.l shows, it is not. □ 

By contrast, our logic is monotonic: it is given by a sequent calculus of the 
normal sort, with the weakening rule. 

Example B.3 We should consider the analogue of Example B.l in our system. 
The two theories would correspond to two different modal operators, Di and 
□2, and we have : 



Dip ^p Di^p 

D2P = p D2^P = ^P 
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The two sets of propositions Fj are as follows: 



T2 




This is in accordance with the expected results: for Ti, only the world which 
forces p is ctmsally explained, and so Fi (essentially) contains only p. For T2, 
both worlds are causally explained (both the one which forces p and the one 
which forces -ip), and r2 is essentially vacuous. 

So, although our modal logic and Turner's yield the same sets of causally ex- 
plained models (and hence the same sets of causally explained propositions), 
they have radically different properties. Ours is monotonic and has a well- 
behaved proof theory: Turner's is non-monotonic and, as it stands, does not 
have a proof theory. Although Turner's system is formulated using S5, Propo- 
sition B.2 shows that we cannot, in any obvious way, use the proof theory of 
S5 as a proof theory for his system. 
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